A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users

نویسندگان

Gert Dekkers

Toon van Waterschoot

Bart Vanrumste

Bert Van Den Broeck

Jort F. Gemmeke

Hugo Van hamme

Peter Karsmakers

چکیده

In this paper a multi-channel speech enhancement framework for distant speech acquisition in noisy and reverberant environments for Non-negative Matrix Factorization (NMF)-based Automatic Speech Recognition (ASR) is proposed. The system is evaluated for its use in an assistive vocal interface for physically impaired and speech-impaired users. The framework utilises the Spatially Pre-processed Speech Distortion Weighted Multichannel Wiener Filter (SP-SDW-MWF) in combination with a postfilter to reduce noise and reverberation. Additionally, the estimation uncertainty of the speech enhancement framework is propagated through the Mel-Frequency Cepstrum Coefficients (MFCC) feature extraction to allow for feature compensation in a later stage. Results indicate that a) using a trade-off parameter between noise reduction and speech distortion has a positive effect on the recognition performance with respect to the well-known GSC and MWF and b) the addition of a postfilter and the feature compensation increases performance with respect to several baselines for a non-pathological and pathological speaker.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition

We propose a novel acoustic beamforming method using blind source separation (BSS) techniques based on non-negative matrix factorization (NMF). In conventional mask-based approaches, hard or soft masks are estimated and beamforming is performed using speech and noise spatial covariance matrices calculated from masked noisy observations, but the phase information of the target speech is not adeq...

متن کامل

Multi-Constraint Nonnegative Matrix Factorization Approach to Speech Enhancement with Nonstationary Noise

The enhancement of speech degraded by nonstationary noises and low signal-to-noise ratio (SNR) conditions is a high demanding but challenging task. We present a robust and effective single channel speech enhancement algorithm under the framework of Nonnegative Matrix Factorization (NMF). Considering the sparse property of speech and low-rank property of nonstationary noise, a new formulation fo...

متن کامل

Single-channel speech enhancement based on non-negative matrix factorization and online noise adaptation

In this paper, we demonstrate a simulator for real-time speech enhancement based on a non-negative matrix factorization (NMF) technique. In particular, we propose an online noise adaptation method in an NMF framework, which is activated during non-speech intervals and used for adapting noise bases for NMF. Thus, incoming noisy speech is decomposed by using such adapted noise bases and universal...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering

This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users

نویسندگان

چکیده

منابع مشابه

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition

Multi-Constraint Nonnegative Matrix Factorization Approach to Speech Enhancement with Nonstationary Noise

Single-channel speech enhancement based on non-negative matrix factorization and online noise adaptation

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Speech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering

عنوان ژورنال:

اشتراک گذاری